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Abstract — We  study  the  resequencing  delay  caused  by  multi- 
path  routing.  We  use  a  queueing  model  which  consists  of  parallel 
queues  to  model  the  network  routing  behavior.  We  define  a  new 
metric,  denoted  by  7,  to  study  the  impact  of  resequencing  on 
the  customer  end-to-end  delay.  Our  results  characterize  some 
properties  of  7  with  respect  to  different  service  time  distributions. 
In  particular,  the  resequencing  delay  can  be  negligible  when  the 
delay  along  each  path  is  light-tailed,  but  can  be  of  major  concern 
when  it  is  heavy-tailed. 

I.  Introduction 
A.  Resequencing  delay 

Multipath  routing  has  recently  received  a  lot  of  attention 
in  the  context  of  both  wired  and  wireless  communication 
networks.  By  sending  data  packets  along  different  paths, 
multipath  routing  can  potentially  help  balance  the  traffic  load 
and  reduce  congestion  levels  in  the  network,  in  the  process 
resulting  in  lower  end-to-end  delay  and  higher  throughput. 
This  path  diversity  also  increases  the  ability  of  the  network 
to  adapt  to  link  failures,  an  important  issue  in  wireless  ad  hoc 
networks  where  the  topology  often  changes  [10,  16]. 

There  are  already  network  layer  protocols  that  provide 
multipath  routing,  e.g.,  TORA  [12]  for  MANETs,  and  trans¬ 
port  layer  protocols  supporting  multipath  routing  are  currently 
under  development.  For  example,  the  Stream  Control  Trans¬ 
mission  Protocol  (SCTP)  with  multihoming  is  one  approach 
for  supporting  stream  applications,  such  as  multimedia,  using 
multipath  routing.  Under  SCTP,  multiple  IP  addresses  are 
associated  with  one  client.  The  connection  between  these 
different  IP  and  the  server  IP  are  established  on  different 
routes.  As  with  TCP,  SCTP  is  responsible  for  congestion 
control  and  assuring  packets  arrival  order. 

Since  consecutive  packets  travel  possibly  along  different 
paths  from  source  to  destination,  they  can  easily  be  mis- 
ordered,  i.e.,  received  out-of-order,  at  the  destination.  If  the 
application  requires  the  packets  to  be  processed  in  a  certain 
order  at  the  destination,  e.g.,  say  the  order  in  which  they 
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were  sent,  then  the  mis -ordered  packets  have  to  wait  an 
additional  amount  of  time,  known  as  the  resequencing  delay, 
before  being  consumed.  If  this  resequencing  delay  is  too  large, 
it  can  significantly  degrade  the  performance  of  some  real 
time  streaming  applications.  The  throughput  provided  by  the 
network  layer  may  be  high,  but  the  instantaneous  throughput 
measured  by  applications  may  not  be  satisfactory  and  may 
result  in  a  poor  end-to-end  user  experience. 

Usually,  the  impact  of  mis-ordering  is  evaluated  by  com¬ 
paring  the  expected  resequencing  delay  against  the  expected 
(system)  end-to-end  delay  [14,  7].  These  first  order  statistics 
give  some  information  concerning  the  average  behavior,  but 
are  only  crude  indicators  of  the  more  subtle  interactions 
taking  place.  Here,  we  seek  to  evaluate  the  relative  importance 
of  the  resequencing  and  end-to-end  delays  from  a  different 
perspective.  We  do  so  by  means  of  a  new  metric  that  measures 
the  conditional  probability  of  the  resquencing  delay  given  the 
end-to-end  delay;  see  below  for  a  more  detailed  description. 

B.  A  simple  model  of  mis-ordering  and  multi-path  routing 

We  shall  evaluate  this  new  metric  in  the  context  of  a  simple 
queueing  model  with  mis-ordering,  namely  a  set  of  K  parallel 
single  server  queueing  stations  -\GI |1  fed  by  Poisson  arrivals 
under  probabilistic  state-independent  routing.  In  short,  the 
discussion  will  be  given  for  a  queueing  system  comprising 
a  set  of  K  parallel  M\GI\1  queues. 

This  model  constitutes  an  ersatz  of  the  very  complex 
situation  we  seek  to  investigate:  The  multiple  paths  between 
a  given  source/destination  pair  correspond  to  a  set  of  parallel 
stations  with  each  source/destination  path  represented  by  a 
single  server  queue.  While  many  of  the  details  of  the  protocol 
have  been  eliminated,  the  essence  of  network  behavior  (i.e., 
mis-ordering)  has  been  preserved.  By  controlling  the  service 
time  distribution  at  a  station,  we  can  model  the  delay  of  the 
packets  traveling  along  that  path.  When  a  packet  is  generated 
at  the  source,  it  is  sent  out  along  one  of  the  paths.  Upon 
reaching  the  destination,  out-of-order  packets  are  stored  in  a 
resequencing  buffer  until  they  satisfy  the  ordering  condition 
of  the  application,  at  which  point  they  leave  the  resequencing 
buffer. 

The  study  of  resequencing  problem  in  queueing  networks  is 
not  without  history  [5,  8,  2,  4,  13,  7].  Recently,  large  deviations 


results  have  been  obtained  for  the  size  of  the  resequencing 
buffer  for  a  system  of  two  parallel  M\M\1  queues  [15].  The 
analysis  given  here  has  its  point  of  departure  in  the  work  of 
[8]  where  the  distributions  of  the  resequencing  and  end-to- 
end  delays  were  derived  for  a  system  of  parallel  M\GI\1 
queues.  Unfortunately,  most  of  the  closed-form  results  rely 
on  the  service  times  being  exponentially  distributed,  while 
general  service  time  distributions  do  not  always  give  nice 
form  solutions.  For  our  purpose  this  limits  somewhat  the 
usefulness  of  the  results  from  [8]  since  in  many  circumstances, 
it  may  not  be  appropriate  to  assume  that  the  service  times  are 
exponentially  distributed.  For  instance,  in  the  Internet,  traffic 
coming  to  a  router  arises  as  the  aggregation  of  many  flows 
from  many  different  sources,  and  the  impact  of  the  admission 
control  and  scheduling  policies  is  far  from  understood.  It  is 
therefore  unrealistic  to  assume  that  packet  delays  along  a  path 
are  exponentially  distributed.  Thus,  in  the  name  of  model 
robustness  (and  ignorance),  it  seems  appropriate  to  make  as 
few  assumptions  on  the  service  time  distributions  as  possible. 

C.  A  new  metric 

Let  R  and  D  denote  the  (stationary)  packet  resequencing 
and  end-to-end  delays,  respectively.  The  question  of  interest 
here  is  as  follows:  If  a  packet  experiences  a  long  end- 
to-end  delay,  is  this  caused  by  resequencing?  One  way  to 
approach  this  issue  would  be  to  study  the  joint  distribution 
of  R  and  D,  in  the  process  going  one  step  further  than  the 
results  on  the  individual  distributions  currently  available  in  the 
literature.  However,  as  should  be  clear  from  earlier  comments, 
such  a  joint  distribution  depends  heavily  on  the  service  time 
distributions  and  is  likely  to  be  in  quite  a  cumbersome  form 
unsuited  for  comparison. 

For  these  reasons,  we  focus  instead  on  a  new  metric  that 
reveals  the  relationship  between  the  tail  behaviors  of  R  and 

D.  More  precisely,  define 

7 

Since  R  <  D,  we  have  0  <  7  <  1  and  it  is  plain  that  7  is 
asymptotically  the  conditional  probability 

P  [R>  x,D  >  x\ 

7  =  lim  — L  - - - -  =  lim  P  [R  >  x\D  >  x\  . 

x^oo  P  [D  >  X\  x^oo 

Thus  7  could  also  be  interpreted  as  the  asymptotic  fraction  of 
customers  that  given  their  end-to-end  delay  exceeds  x,  their 
resequencing  delay  also  exceeds  x  (for  large  x ).  In  general, 
the  smaller  value  7  takes,  the  smaller  impact  the  resequencing 
delay  has  on  the  end-to-end  delay.  In  some  extreme  cases,  for 
example: 

(i)  If  7  =  0,  then  the  tail  of  R  is  much  lighter  than  the 
tail  of  D  -  If  a  customer  experiences  a  long  end-to-end 
delay,  it  is  very  likely  due  to  the  delay  along  the  path, 
not  to  resequencing;  and 

(ii)  If  7  =  1,  then  R  and  D  are  equivalent  in  the  tail  -  Thus, 
the  long  resequencing  delay  is  a  major  contributor  to  the 
long  end-to-end  delay  suffered  by  the  customer. 


P  [R>x] 

lim  —7 - r 

x— >00  P  [D  >  x\ 


The  tail  probability  of  the  service  time  distributions  is 
crucial  in  determining  7.  Hence  we  will  examine  7  under  a 
wide  range  of  service  time  distributions  from  light-tail  distri¬ 
bution,  e.g.,  exponential  distribution,  to  heavy-tail  distribution. 
Whether  these  service  time  distributions  can  be  found  in 
existing  networks  is  a  subject  for  further  study. 

D.  Outline  of  the  paper 

The  paper  is  organized  as  follows:  In  Section  II,  we  describe 
the  model  and  state  some  preliminary  results  from  [8].  Some 
simple  but  useful  asymptotics  that  will  be  repeatedly  used 
are  given  in  Section  III.  Sections  IV  -  IX  contain  the  main 
results  on  the  evaluation  of  7.  In  Section  IV  we  derive  the  tail 
behavior  of  D  under  any  service  distributions.  We  examine 
the  tail  behavior  of  R  for  different  service  time  distributions 
in  Sections  V  and  VII  -  IX.  Section  VI  clarifies  the  definitions 
and  summarizes  the  properties  of  tails  of  distributions,  which 
are  used  in  the  rest  of  the  paper.  The  two  extreme  cases,  i.e., 
exponential  services  vs.  heavy  tailed  services,  are  compared 
in  Section  X  and  Section  XI  concludes  the  paper  with  a  few 
discussions  on  the  future  work. 

A  word  on  the  notation  used  in  this  paper.  Two  R-valucd 
rvs  X  and  Y  are  said  to  be  equal  in  law  if  they  have  the 
same  distribution,  a  fact  we  denote  by  X  =  st  Y.  The  rv  X  is 
stochastically  smaller  the  rv  Y  if  P  [X  >  x]  <  P  [Y  >  x\  for 
all  x  in  R,  a  fact  we  denote  by  X  <st  Y.  We  assume  the  reader 
to  be  familiar  with  the  classes  S  and  £  of  subexponential  and 
long-tailed  probbaility  distributions,  respectively.  Additional 
material  on  these  classes  of  distributions  is  available  in  the 
monograph  [6], 

II.  The  model  and  preliminaries 

The  system  of  interest,  depicted  in  Figure  1,  consists  of 
K  queueing  stations  operating  in  parallel,  followed  by  a 
resequencing  buffer.  Upon  arrival,  each  customer 1  is  routed  to 


Queue  1  H  i 


Resequencing 

buffer 


Fig.  1.  System  model 


one  of  the  K  stations.  When  it  completes  service,  the  customer 
leaves  the  system  immediately  only  if  all  prior  customers  2  have 
already  left  the  system;  otherwise,  the  customer  awaits  in  the 
resequencing  buffer  until  all  prior  outstanding  customers  com¬ 
plete  their  service,  at  which  point  it  leaves  the  resequencing 
buffer.  This  ensures  that  each  customer  leaves  the  system  in  the 

'From  now  on  we  adopt  the  generic  terminology  of  Queueing  Theory  and 
refer  to  packets  as  customers. 

2Prior  customers  refer  to  the  customers  which  have  entered  the  system 
before  that  particular  customer. 


order  in  which  it  arrived,  possibly  experiencing  in  the  process 
an  additional  delay  in  the  resequencing  buffer.  Of  particular 
interest  is  the  end-to-end  delay  experienced  by  customers;  this 
quantity  is  defined  as  the  sum  of  the  customer  sojourn  time 
(i.e.,  the  time  spent  at  the  station  it  joined,  and  composed  of 
the  time  waiting  in  buffer  for  service  and  of  the  time  in  service) 
and  of  the  customer  resequencing  delay  (i.e.,  the  time  spent 
in  the  resequencing  buffer). 

The  discussion  is  carried  out  under  the  following  assump¬ 
tions:  For  each  k  =  1, . . . .  K,  the  kth  queueing  station  is 
modeled  as  an  infinite  capacity  buffer  attended  by  a  single 
server  which  serves  customers  in  the  FCFS  order;  hereafter 
we  refer  to  the  kth  queueing  station  as  queue  k.  Consecutive 
service  time  durations  provided  at  queue  k  are  assumed  to 
be  i.i.d.  M+-valued  rv,  and  let  <j denote  the  correspond¬ 
ing  generic  service  time.  Customers  arrive  into  the  system 
according  to  a  Poisson  process  with  parameter  A,  and  each 
incoming  customer  is  routed  probabilistically  to  queue  k  with 
probability  pk,  k  =  with  pk  >  0  and  Y^k= 1  Pk  =  1- 

Routing  decisions  are  made  independently  across  customers, 
and  can  therefore  be  modeled  as  a  sequence  of  i.i.d.  thinning 
rvs.  Throughout,  we  assume  that  the  Poisson  arrival  process, 
the  routing  process  and  the  service  duration  processes  at  the 
K  queues  are  mutually  independent. 

Under  these  assumptions,  by  standard  properties  concerning 
the  Bernoulli  thinning  of  Poisson  processes,  it  is  plain  that  the 
K  single  server  systems  can  be  interpreted  as  K  independent 
M\G\1  queues  operating  in  parallel.  For  each  k  =  1, . . . ,  K, 
queue  k  is  fed  by  Poisson  arrivals  with  parameter  A pk  and 
generic  service  time  <Jk,  and  its  stability  is  characterized  by 

Pk  ■=  ApfcE  [crfc]  <  1.  (1) 

Under  (1),  there  exist  IK +- valued  rvs  Wk  and  Tk  which  denote 
the  (customer)  stationary  waiting  and  sojourn  times  in  queue 
k. 

The  system  is  stable  if  and  only  if  all  queues  are  stable, 
namely 

Pk  <  1,  k  =  1, . . . ,  K.  (2) 

Then,  under  (2),  there  exist  K+ -valued  rvs  T,  R  and  D  which 
denote  the  (customer)  stationary  sojourn  time,  resequencing 
delay  and  end-to-end  delay,  respectively.  The  next  two  results 
summarize  the  distributional  properties  of  these  rvs,  which  can 
be  found  as  Theorems  3.1  and  3.2  in  [8]  with  slightly  different 
notation:  Let  v  be  an  {1, . . . ,  A'}-valued  rv  distributed  accord¬ 
ing  to 

P  [v  =  k]  =  pk,  k  =  1, . . . ,  K. 

In  what  follows,  the  rvs  Wk  and  ak,  k  =  1, ...  ,K,  and  v  are 
assumed  to  be  mutually  independent  rvs  defined  on  the  same 
probability  space  (Pl,T ,  P),  and 

Tk  =  Wk  +  ak,  k  =  1, . . . ,  K.  (3) 

Strictly  speaking,  we  only  have  Tk  =st  Wk  +  &k  for  each 
k  =  1, . . . ,  K.  However,  we  shall  engage  in  the  common  and 
somewhat  innocuous  practice  of  taking  Wk  +  crk  to  be  the 


customer  stationary  sojourn  time  Tk  in  the  stable  M|G|1  queue 
introduced  earlier. 

Theorem  2.1:  With 


it  holds  that 


and 


=  max  Wk , 

(4) 

T=TV, 

(5) 

max  (w, 

(6) 

/ —  \  + 

[W  -  Tv\  . 

(7) 

The  resequencing  delay  of  a  customer,  if  nonzero,  is  the 
additional  time  it  waits  for  all  prior  outstanding  customers  to 
complete  their  service  at  all  queues  other  than  selected  queue 
v.  As  a  result,  from  (5)  and  (6)  we  get 

R  =  D  —  T  =  max  (w ,  T^j  —  Tv 
and  the  form  of  (7)  follows. 

We  can  now  make  use  of  these  representations  to  get  the 
distributions  of  D  and  R  in  terms  of  those  of  the  rvs  Wk  and 
Tfc,  k  =  1,...,K.  With 


Wk  =  max  W{,  k  =  1, . . . ,  K  (8) 

i^k 

it  is  a  simple  matter  to  conclude  the  following: 

Theorem  2.2:  The  distribution  functions  of  the  rvs  R  and  D 
are  given  by 


K 

P  [D  <  x\  =  max  (wk,T^j 


<  x 


k= 1 


and 


K 


p  [R  <  x]  =  Y,pkP 


Wk  <  x  +  Tk 


fc= i 


(9) 

(10) 


for  all  x  >  0. 

As  pointed  out  in  the  introduction,  we  seek  to  evaluate  the 
ratio 


7  =  lim 

x— >00 


P  [A  >  x] 
P  [D  >  x] ' 


(11) 


Our  strategy  for  doing  so  is  to  identify  the  tail  behavior  of 
the  distributions  of  D  and  R ,  respectively.  The  next  section 
present  some  of  the  basic  tools  to  do  just  that.  This  is  followed 
in  Section  IV  by  a  general  result  of  the  tail  behavior  of  D. 


III.  Some  easy  but  useful  asymptotics 

We  shall  have  repeated  use  for  the  next  elementary  lemma 
where  asymptotic  equivalence  is  defined  as  follows:  For  map¬ 
pings  f,g  :  K+  — >  M,  we  write  f(x)  ~  g(x)  ( x  —>  oo)  if 
lim^-xx,  =  1.  In  what  follows,  without  further  mention, 
all  asymptotics  are  understood  in  the  regime  where  x  is  large, 
and  the  qualifier  x  — >  oo  is  now  dropped  from  the  notation. 


Lemma  3.1:  Consider  a  finite  family  of  functions 
fi,---,fL,9u*--,gL  ■  R+  ->  R-  If  fi(x)  ~  gi{x)  for 
each  £  =  1 ,L,  then  we  have 


~  J2gt(x). 


IV.  The  tail  behavior  of  D 

Taking  (9)  as  the  point  of  departure,  we  first  deal  with  the 
tail  behavior  of  the  rvs  W\ , . . . ,  Wk  ■ 

Lemma  4.1:  For  each  k  =  1, . . . ,  K,  it  holds  that 


i=  l 


e=i 


Now  consider  a  finite  collection  of  R+ -valued  sequences, 
say  {ai>ra,  n=  1,2,...},  ....  { ciL,n ,  n  =  1,2,...}  for  some 
integer  L. 

Lemma  3.2:  If  lim„_>00  «/:  n  =  0  for  each  £  =  1, ...  ,L, 
then  it  holds  that 


Wk  >  x 


Y  p  m  >  xj . 


(17) 


hm  1  n^i  (1  «<,»)  =  L 

n—>oc  n  n 

/  l  a£,n 


(12) 


Proof.  This  is  a  straightforward  consequence  of  Lemma  3.2 
once  we  note  that 


Before  giving  a  proof,  recall  some  elementary  facts  from 
calculus:  As  x  j  0,  we  have 


Wk  >  x 


=  l-Y[P[We< 


i^tk 


and 


log(l  —  x)  =  —x  +  o(x),  0  <  x  <  1 


1  —  ex  =  —x  +  o(x):  x  >  0. 


(13) 


(14) 


1  -  n  - p  >  *]) .  *  >  ° 


under  the  independence  assumptions. 


□ 


Proof.  Fix  n  =  1,2,...  and  write 

L 


Lemma  4.1  readily  leads  to  the  next  asymptotic  equivalence. 
Lemma  4.2:  For  each  k  -  1. ....  K.  it  holds  that 


it, 

Then,  we  note  that 

L 


=  Y log  (-1  ~  «<>)  • 


max 


>  x 


e=i 


( [wk,Tk ) 

P  [Tfc  >  x]  +  Y  p  [Wt  > 

e^k 


(18) 


1  —  11(1  -at,n)  =  l-nel°g(1-^ 

f-i 

=  1  -  eUn 


e=i 


Proof.  Fix  k  =  1, ...  ,K  and  x  >  0.  By  independence  of  Wk 
and  Tfc,  we  find 


-un  +  o(un) 

-«n(  1  +  0(1)) 


(15) 


max 


(wife,  Tfc) 


>  x 


as  we  make  use  of  (14)  since  lim^^oo  un  =  0  under  the 
enforced  assumptions.  On  the  other  hand,  (13)  readily  yields 


=  P  [Tfc  > 


Wk  >  x 


P  [Tfc  < 


=  ~  Y  aZ,n  (1  +  o(l)) 

e=i 

=  -  (H-oW). 


and  the  result  is  immediate  from  Lemma  4.1  as  we  note  that 


Wk  >  x  P  [Tfc  <  x]  ~  P 


Wk  >  x 


□ 


(16) 


Combining  (15)  and  (16)  yields 

L  /  L 


1  -  U  (1  -  ae,n)  =  Y  ae’n  (1  +  °(1)) 


i=\  \e= i 

and  the  desired  conclusion  follows. 


□ 


We  are  now  in  a  position  to  characterize  the  tail  behavior 
of  the  end-to-end  delay  D. 

Proposition  4.3:  It  holds  that 

P  [D  >  x\ 

K 

~  Y/PkP[Tk>x\  +  {l-pk)P[Wk>x\.  (19) 

k=  1 


Proof.  From  (9),  upon  using  Lemma  4.2,  we  get 

K 


p  [D  >  x\  =  E^fcP  max(wk,Tk'j 


fc= 1 
K 


>  X 


E**  \P[Tk>x]  +  Y/P[We> 

k=  1  y  tj^k 

K 

[Tk  >  X ] 


k= 1 


K 


+  Z>  Epi^> 

^=1 

Easy  algebra  leads  to  desired  conclusion  (19)  since 


K 


K 


/c=i 


for  all  x  >  0. 


x  = 


E  5>  p[Wi> 

£=1  \k^£  ) 

K 

E(1  -pt) p  \Wt  >  x\ 


e=i 


Proposition  5.1:  For  exponentially  distributed  services 
(21),  it  holds  that 


P  [D  >  x]  ~  |  E  (Pk  +  (1  -  Pk)Pk)  J  e 
VfceAr  / 


Proof.  Substituting  (22)  and  (23)  into  (19)  leads  to 

K 

P  [D  >  x]  ~  E  (Pfc  +  (!  -  Pfc)Pfc)  e_afca:, 


(26) 


fc=i 


and  the  conclusion  (26)  is  now  straightforward. 


□ 


(20) 


Next,  we  tackle  the  tail  behavior  of  R.  To  state  the  result. 


and 


with 


□ 


1, ...  ,K,  define 

07.  :=  min  (at,  l  ^  k) 

(27) 

n 

?r 

li 

M 

^s> 

(28) 

\ee  Mk  / 

Mk  :=  {(-  Y  k  '■  ae  =  a*k }  • 

(29) 

The  tail  behavior  of  R  is  more  delicate  to  obtain  and 
depends  very  much  on  the  distributional  assumptions  made  on 
the  service  time  durations.  We  explore  this  point  in  turn  for  (i) 
exponential  services  [Section  V],  (ii)  services  with  exponential 
tails  [Section  VII],  (iii)  subexponential  services  [Section  VIII] 
and  (iv)  a  combination  of  service  times  with  exponential  and 
heavy  tails  [Section  IX]. 


Proposition  5.2:  For  exponentially  distributed  services 
(21 ),  it  holds  that 


( 


P  [R  >  x\  ~  (  E  Vk  .  ak  rfc  e-0** 


\kGM* 


a*  +  ak 


where 


M*  :=  {k=  :  cx*k  =  a*}. 


(30) 


(31) 


V.  Exponential  services 

We  begin  with  the  simple  situation  where  the  service  times 
at  each  queue  are  exponentially  distributed,  i.e.,  for  each  k  = 
1, . . . ,  K ,  we  have 

P  [o-fc  <  x\  =  1  -  e~^kX,  x>0  (21) 

for  some  p,k  >  0.  Queue  k  is  an  M\M\1  queue  and  closed- 
form  expressions  are  thus  available  for  the  distributions  of  the 
waiting  time  Wk  and  of  the  sojourn  time  Tk.  We  write 

otk  :=  Pk  ~  A pk  =  Pk(  1  —  Pk) 

where,  as  customary,  we  have  set  pk  =  A2L.  It  is  known  [9, 
pp.  202-203]  that 

P  [Wk  <  x]  =  1  -  pke~akX,  x>0  (22) 

and 

P  [Tk  <  x]  =  1  -  e~akX,  x>0.  (23) 

In  this  case,  we  can  compute  7  directly.  Set 

a *  :=  min  (ai, . . . ,  ax)  (24) 

and  introduce 


(25) 


A  proof  of  Proposition  5.2  is  given  in  Appendix  I.  The 
evaluation  of  7  is  now  within  easy  reach  by  combining 
Propositions  5.1  and  5.2. 

Proposition  5.3:  For  exponentially  distributed  services 
(21 ),  it  holds  that 

_  Fk  cK*-(-afc  I3TI 

7  T,keM(Pk  +  0--Pk)PkY 

The  expression  (32)  takes  a  particularly  simple  form  when 
the  waiting  time  and  sojourn  time  distributions  at  all  the 
queues  have  the  same  decaying  rate;  this  can  be  seen  by  direct 
substitution. 

Corollary  5.4:  For  exponentially  distributed  services  (21) 
with  «!  =  ...  =  ax,  we  have 

1  r 

7 - 21+r 

where 

K 

r  ;=  ^2pk{l-Pk)- 

k- 1 

In  this  special  case,  we  have  7  <  5  for  any  finite  value 
K ,  which  implies  that  a  long  end-to-end  delay  is  more  likely 


M.  :=  {k  =  1, . . . ,  K  :  ak  =  or*}  . 


to  result  from  factors  other  than  resequencing.  However,  it  is 
still  possible  for  7  ~  1  under  certain  conditions.  For  instance, 
when  the  service  rates  of  the  stations  are  drastically  different 
among  themseleves,  the  resequencing  delays  of  mis-ordered 
customers  become  a  large  portion  of  their  end-to-end  delays. 
We  illustrate  this  point  with  the  following  numerical  example 
with  three  stations  in  parallel,  i.e.,  K  =  3.  The  customer  arrival 
rate  is  A  =  5,  and  the  service  rates  are  // 1  =  0.3,  P2  =  8 
and  /is  =  12,  respectively.  Stability  is  ensured  by  taking  the 
routing  probabilities  to  be  p\  =  0.05,  P2  =  0.35  and  p^  =  0.6. 
Direct  calculations  based  on  (32)  yield  7  =  0.9346.  Station  1 
has  the  slowest  service  rate,  which  limits  the  amount  of  work 
that  can  be  routed  to  it.  Thus,  a  large  number  of  customers 
served  by  stations  2  and  3  are  very  likely  to  spend  a  long 
time  in  the  resequencing  buffer  waiting  for  a  small  number  of 
customers  arrived  earlier  to  finish  their  service  from  station  1. 

VI.  Tails  of  distributions 


may  not  exist,  as  can  be  seen  through  the  example 


P  [X  <  x]  =  1  - 


I\  +  cos  x 

- e 

K+l 


x  >  0 


with  K  >  2. 3  Obviously  0*  =  1  here  but 


P  [X  >  x\  K  +  cos  x 
e~6*x  ~  K  +  l 


and  the  limit  does  not  exist  owing  to  the  oscillatory  nature  of 
the  cosine  function. 

Yet,  as  we  shall  see  shortly,  there  are  natural  circumstances 
where  the  limit  (38)  exists,  i.e.,  there  exist  constants  a  >  0 
and  C  >  0  such  that 


P  [X  >  a]  ~  Ce~ax.  (39) 

In  that  case,  it  is  easy  to  check  from  (36)  that  a  =  6*  with 
E  [ee'Y]  <  00  (resp.  E  [eex]  =  00)  whenever  9  <  a  (resp. 
a  <  9). 


In  this  section  we  have  collected  some  of  the  basic  defini¬ 
tions  and  properties  pertaining  to  tails  of  distributions  which 
we  shall  use  in  the  remainder  of  this  paper. 


A.  Exponential  tails 

Definition  6.1:  The  R+-vaIued  rv  X  has  an  exponentially 
decaying)  tail,  denoted  X  £  £,  if 

9 *  :=  sup  (0  <E  R  :  E  [eex]  <  00)  >  0.  (33) 

Note  that 

E  [ee'Y]  =00,  0*  <  9. 

Markov’s  inequality  (applied  to  eex  with  9  >  0)  yields 

P  [X  >  x\  <  e~0xE  [eeY]  ,  x  >  0  (34) 


whenever  0  <  9  <  0*,  and  the  rv  X  satisfying  (33)  indeed 
displays  an  exponentially  decaying  tail. 

A  little  more  can  be  said  under  (33):  It  is  plain  from  (34) 
that 

lim  P  ^  =0,  0  <  0  <  0*  (35) 

x — ►oo  e 

On  the  other  hand,  for  each  0  >  0,  we  have 


E  [e9-Y]  =  1  +  E 

=  1  +  E 


9eexdx 


/>00 

/  1  [X  >  x]  9e6xdx 

Jo 


=  1+  9etixP  [X  >  x]  dx 


(36) 


Jo 

by  Tonelli’s  Theorem.  With  the  help  of  this  last  relationship, 
it  is  then  not  too  difficult  to  show  that 


lim 

X — ► OO 


P[X>  X ] 
e-6x 


=  OO, 


0*  <  0. 


(37) 


B.  Heavy  tails 

In  order  to  capture  the  notion  of  heavy  tailed  distributions, 
we  shall  rely  on  the  following  definitions. 

Definition  6.2:  The  R+ -valued  rv  X  has  a  long  tail,  denoted 
X  £  £,  if  P  [X  >  x\  >  0  all  x  >  0  and 


lim 

x—>oo  P  [X  >  x] 


=  1, 


y  e  R. 


(40) 


Definition  6.3:  The  K+ -valued  rv  X  has  a  subexponential 
tail,  denoted  X  £  S,  ifP  \X  >  x\  >  0  all  x  >  0  and 


P[X  +  Y  >  x] 
hm  — — — — - - — 

x—>oo  P  [X  >  x] 


=  2 


(41) 


where  Y  is  an  independent  copy  of  X. 

The  inclusion  S  C  C  is  known  to  hold  [6,  p.  50]  (and 
references  therein).  Moreover,  S  contains  many  well-known 
distributions  such  as  Weibull,  log-normal,  Pareto  and  regularly 
varying  distributions.  Some  useful  facts  concerning  the  classes 
£  and  S  are  presented  below; 

Proposition  6.4:  Assume  X  £  C. 

(i):  For  every  0  >  0,  it  holds  that 


lim  e^’P  [X  >  x]  =  00 

x — >-oo 


whence  E  [eex]  =  00; 

(ii):  If  0  <  E  [A"]  <  00,  then 


P  [X  >  x ]  =  o(P  [X*  >  *]) 


where  X*  denotes  the  forward  recurrence  time  associated  with 
X;  it  is  the  rv  with  integrated  tail  distribution  given  by 

/»oo 

P  [A*  >  x]  :=  E[A]_1  /  P[X  >t]dt,  x  >  0;  (42) 

J  X 

(iii):ForanyR+-valuedrvY  which  is  independent  of  X ,  the 
equivalence 


In  general,  the  limit 


P  [A  >  x  +  Y\  ~  P  [A  >  a;] 


lim 

x — >-oo 


P[X>  x ] 

e-9'x 


'This  will  ensure  that  the  right  handside  is  indeed  increasing  and  therefore 
a  bona  fide  distribution. 


while 


holds. 

A  proof  is  available  in  Appendix  II.  The  following  fact  is 
a  simple  consequence  of  (  35)  and  of  Claim  (i)  of  Proposition 
6.4. 

Corollary  6.5:  For  R+ -valued rvs  X  and  Y ,  it  holds  that 
P[X  >  x]  =  o(P  [Y  >  x])  (43) 

whenever  X  £  £  and  Y  £  C. 


VII.  The  exponential  tail  case 

We  generalize  the  setup  of  Section  VII  to  service  time 
distributions  with  exponentially  decaying  tails:  For  each  k  = 
1, . . . ,  K,  we  assume  ok  £  £,  namely 

9k  :=  sup  (fleR:  E  [e8ak]  <  oo)  >  0.  (44) 

At  this  level  of  generality,  closed  form  expressions  such  as 
(22)  and  (23)  are  usually  not  available  for  the  rvs  IT/,  and  Tk. 
However,  the  tail  behavior  of  Wk  can  be  characterized  through 
the  classic  Cramer-Lundberg  approximation  [1,  Example  5.5, 
p.  366].  For  easy  reference,  we  briefly  discuss  this  approxi¬ 
mation  in  the  context  of  M\G\1  queues  considered  here:  The 
Lundberg  equation  takes  here  the  special  form 

E  [e6**]  =  1  +  (Xpk)~19,  9  >  0  (45) 

and  under  the  stability  condition  (1)  it  has  a  unique  solution 
which  we  denote  bycifc.  It  is  plain  that  oik  <  As  we  are  in 
the  non-lattice  case  for  the  underlying  random  walk,  it  follows 
[1,  Thm.  5.3,  p.  365]  that 

P  [Wk  >x}~  Cke~akX  (46) 

for  some  constant  Ck  given  by 


Ck  := 


1  ~  Pk 

ApfeE  [ okeakC7k }  -  1 


(47) 


with  pk  given  by  (1).  Details  are  available  in  [1,  p.  367], 

The  Cramer-Lundberg  approximation  leaves  open  the  ques¬ 
tion  of  the  tail  behavior  of  the  sojourn  time  Tk.  Given  that 
Wk  <st  Tk,  it  is  plain  from  (46)  that  if  the  asymptotics 


P  [Tk  >x]~  Dke~0kX  (48) 


were  to  hold  with  constants  Dk  >  0  and  /3k  >  0,  then 
necessarily  (3k  <  oik ■  In  the  case  of  exponentially  distributed 
services,  we  do  in  fact  have  (48)  with  /3k  =  oik-  However, 
the  asymptotics  (48)  appear  not  to  hold  in  the  same  level  of 
generality  as  the  Cramer-Lundberg  approximation  (46).  Still, 
for  our  purpose,  some  pertinent  information  can  be  deduced 
from  (46). 

As  already  pointed  out  in  Section  VI,  under  (46),  we  have 
E  [e61^]  <  oo  (resp.  E  [eeWfc]  =  oo)  if  9  <  ak  (resp.  oik  < 
9).  The  independence  of  the  rvs  W k  and  yields 

E  [e8Tk]  =  E  [eeWk]  •  E  [e8ak]  ,  9  €  R  (49) 

and  from  earlier  remarks  we  conclude  that 


E  [e8Tk]  =  oo,  ak  <  9.  (51) 

Using  Markov’s  inequality  again  we  get 

P  [Tk  >  x]  <  e~8xE  [e8Tk]  ,  x>0  (52) 

whenever  0  <  9  <  ak-  In  short,  under  (44),  we  have 

Wk  £  £  and  Tk  £  £.  (53) 

These  observations  can  now  be  put  to  work  towards  charac¬ 
terizing  the  tail  behavior  of  D  and  R. 

Proposition  7.1:  If  (44)  holds  for  each  k  =  1. ....  K ,  then 

P  [D  >  x]  ~  Y  pfcP  [Tk  >  x] 

k£M 

+  (E(1  -Pk)Ck]e-a*x  (54) 
\keM  / 

with  a*  and  M  defined  at  (24)  and  (25),  respectively. 

Proof.  In  view  of  the  observations  made  earlier,  it  is  clear 
that 

P  [Tk  >  x]  =  o(e~a*x),  k(£M  (55) 

since  a*  <  ak  for  k  not  in  A4.  Combining  the  Cramer- 
Lundberg  approximation  (46)  with  the  asymptotics  (19)  now 
leads  to 

P  [D  >  x] 

K  K 

-  YJPkP[Tk>x\  +  YJ0--Pk)Cke-akX 

fc= 1  k—% 

and  (54)  follows  as  we  make  use  of  (55).  □ 


Under  the  enforced  assumptions,  there  is  no  guarantee  a 
priori  that  the  limits 


lim 

x—*oo 


P  [Tk  >  x] 


k  £  M 


exist. 

We  now  turn  to  the  tail  behavior  of  R:  For  each  k  = 
1  define  a*k  and  Aik  as  before  by  (27)  and  (29), 

respectively,  and  set 


r fc  :=  Y,  ct-  (56) 

t.&Mk 


This  is  the  analog  of  (28)  for  service  time  distributions  with 
exponentially  decaying  tails.  A  proof  of  the  next  result  is 
available  in  Appendix  III. 

Proposition  7.2:  If  (44)  holds  for  each  k  =  1, . . . ,  K,  then 


E  [e8Tk]  <  oo 


9  <  min  (6k,  ak)  =  ak 


(50)  with  Ai*  given  by  (31). 


VIII.  The  subexponential  case 
In  this  section,  we  consider  the  situation  where  the  service 
times  at  all  the  queues  are  “heavy  tailed”  in  the  following 
technical  sense:  For  each  k  =  1, ...  ,K, 

cjfc  €  C  and  er^  €  S  (58) 

where  denotes  the  forward  recurrence  time  (42)  associated 
with  the  service  time  rv  a .  Sufficient  conditions  on  07-  to 
ensure  that  <rk  is  subexponential  can  be  found  in  [6,  p.  52], 
The  tail  behavior  of  the  rvs  Wk  and  7’/i;  is  summarized  here 
for  future  reference. 

Proposition  8.1:  Under  the  asumptions  (58),  it  holds  that 

P  [Wk  >  x]  ~  — — P  [at  >  x]  With  Wk  G  5  (59) 

1  -  Pk 

and 

P  [Tfc  >  x]  ~  P  [Wk  >  x }  with  Tk  G  5.  (60) 


Proof.  By  interpreting  the  Pollaczeck-Khintchine  formula  in 
stable  M\GI\1  queues,  we  can  express  the  stationary  waiting 
time  as  a  geometric  sum  of  i.i.d.  forward  recurrence  time 
associated  with  the  service  time  [1,  p.  296]  [9,  p.  201].  More 
precisely,  let  Vk  be  a  geometric  rv  with  parameter  pk,  i.e., 

P  [vk  =  n]  =  (1  -  pk)Pk,  n  =  0,1,... 

and  let  {ak  n,  n  =  1,2,...}  denote  a  collection  of  i.i.d.  rvs 
distributed  according  to  cr£.  Then,  it  holds 

fk 

Wk=stJ2<n  (61) 

n= 1 

where  the  rv  ip;  is  taken  to  be  independent  of  the  sequence 
{(j*k  n,  n  =  1,  2, . . Then,  (59)  follows  by  the  Random  Sum 
Theorem  for  subexponential  rvs  [1,  Lemma  9.2.,  p.  296]  [6, 
Thm.  A3. 20,  p.  580], 

To  obtain  the  tail  behavior  of  Tk,  observe  from  Claim  (ii) 
of  Lemma  6.4  that  G  C  implies 

P  [(Tk  >  X\=  0(P  [CTfc  >  X]), 

and  the  equivalence  P  [Tk  >  x\  ~  P  \Wk  >  x]  follows  from 
(59)  upon  making  use  of  Lemma  A3.23  in  [6,  p.  582], 
The  statement  Tk  G  S  follows  by  tail  equivalence  from  the 
statement  Wk  G  S  [6,  Lemma  A3. 15,  p.  572],  □ 


The  behavior  of  R  is  examined  next,  with  a  proof  given  in 
Appendix  IV. 

Proposition  8.3:  If  (58)  holds  for  each  k  =  1, ....  K ,  then 
we  have 

K 

P[R>x]~y(l-pk)P[Wk>x}.  (64) 

fc= i 


The  asymptotics  (62)  and  (64)  already  suggest 


7  =  lim 

x— >oo 


Ek=i(i-Pk)P[Wk>x} 

Ek=i  p  m  >  x] 


(65) 


provided  the  limit  exists.  This  observation  can  be  exploited  as 
follows  when  uniform  routing  is  used. 

Corollary  8.4:  If  (58)  holds  for  each  k  =  1 , ...  ,K,  with 
pi  —  ...  =  pk  =  -£?,  then  the  limit  (65)  exists  with 


7  = 


K  -  1 
K 


(66) 


This  result  is  independent  of  the  specific  form  of  the  service 
times  (ii, ,  <tk  under  condition  (58).  Moreover,  7  will 
become  increasignly  close  to  1  as  K  becomes  large. 

Under  non-uniform  routing,  the  existence  of  the  limit  at 
(65)  is  not  automatically  guaranteed.  We  explore  the  issue 
by  strengthening  the  underlying  assumptions  used  thus  far. 
Specifically,  in  addition  to  the  assumptions  (58)  holding  for 
each  k  =  1, . . . ,  K,  there  exists  an  R+-valued  rv  Z  G  S  such 
that 

P  [ <rl  >  x\  ~  cfcP  [Z  >  x]  (67) 

for  some  constant  Ck  >  0.  The  case  Ck  >  0  coresponds  to 
the  rvs  <Jk  and  Z  being  tail  equivalent,  and  (67)  does  imply 
<rj£  G  S.  On  the  other  hand,  Ck  =  0  represents  the  situation 
where  the  rv  <r£  has  a  weaker  tail  than  Z.4  In  either  case, 
under  (58)  the  equivalence  (59)  yields 

P  [Wk  >  x]  ~  dkP  [Z  >  x]  with  dk  =  Ck — — —  (68) 

1  -  Pk 

and  (65)  easily  leads  to  the  following  asymptotics. 

Proposition  8.5:  If  (58)  holds  for  each  k  =  1 ,...  ,K  with 
(67),  then  the  limit  (65)  exists  with 


Under  the  asumptions  (58),  reporting  the  asymptotic  equiv¬ 
alence  (60)  into  Proposition  4.3  yields 


K 

P  [D  >  x]  ~  P  ^Wk  >  *]  (62)  Provided 

fc= 1 


and  the  asymptotic  equivalence  (59)  leads  to  the  following 
Proposition  8.2:  If  (58)  holds  for  each  k  =  1, ....  K ,  then 


EfcLiXl  -Pk)dk 
Efc= 1 


7  = 


K 

dk  >  0. 

k—l 


(69) 


p  [d  >  x]  ~  y  — ^ — — p  \<Jk  >  x\ . 

hl~pk 


(63)  4The  property  cr*  G  is  not  necessarily  implied,  and  therefore  needs  to 
be  assumed. 


IX.  The  general  case 

Assume  now  that  the  set  of  K  queues  is  partitioned  into 
two  non-empty  subsets,  say 

{1,...,K}  =  SUH  (70) 

with  8  and  77  containing  the  queues  with  exponentially  decay¬ 
ing  services  times  and  heavy-tailed  service  times,  respectively. 
More  precisely,  we  assume  that  (44)  holds  for  each  k  in  8, 
while  (58)  holds  for  k  in  77.  In  other  words. 

Ok  :=  sup  (0  £  R  :  E  [ee<Tfc]  <  oo)  >  0,  k  £  8  (71) 

and 

cife  €  C  and  a*k  £  S,  k  £  77 .  (72) 

To  eliminate  situations  previously  encountered  in  earlier  sec¬ 
tions,  we  assume  that  both  8  and  77  are  not  empty,  i.e.,  \8\  >  1 
and  77  >  1.  Although  the  symbol  8  has  been  given  two 
different  meanings,  this  will  not  create  any  confusion. 

Under  these  assumptions,  it  is  plain  from  Corollary  6.5  that 

P[We>  x\  =  o(P[Wk>  x})7  l£81k£H  (73) 

and 

P  [Tt  >  x]  =  o(P  [Wk  >  »]),  £  £  8,  ken  (74) 

as  we  recall  that  We  £  8  and  Te  £  8  while  Wk  £  S.  Reporting 
these  observations  into  (19)  gives 

P  [D  >  x]  ~  ^  P  [Wk  >  x]  (75) 

ken 

upon  using  (60).  Finally,  the  explicit  asymptotics  (59)  leads  to 
Proposition  9.1:  If  (71)  and  (72)  hold  under  the  decompo¬ 
sition  (70),  then 

P  [D>x}~J2  K  >  *]  ■  (76) 

ken  1  pk 


The  behavior  of  R  is  examined  next,  with  a  proof  given  in 
Appendix  V. 

Proposition  9.2:  If  (71)  and  (72)  hold  under  the  decompo¬ 
sition  (70),  then 

P  [7?  >  x]  ~  Y,  (1  -  Pk) P  lWk  >  x]  .  (77) 

ken 


This  time,  the  asymptotics  (75)  and  (77)  suggest 
_  i-  Yjken^  ~  Pk) P  [Wk  >  x \ 

7  ™  EkenP[Wk>x\ 


(78) 


provided  the  limit  exists.  This  constitutes  a  natural  generaliza¬ 
tion  of  (65). 

A  situation  where  the  limit  (78)  exists  is  presented  next;  it 
parallels  the  uniform  routing  case  discussed  in  Corollary  8.4: 
Assume  that 


P  ■=  Pk  = 


^  ^ ke£  Pk 

\n\  ' 


k  £  77. 


(79) 


Under  this  assumption,  (75)  and  (77)  lead  to 
P  [7?  >  x]  ~  (1  -  p)  P  [D  >  x \ 

with 

P  [D  >  x]  ~  J2  P  ^Wk  >  *]  • 
ken 

Corollary  9.3:  Assume  that  (71)  and  (72)  hold  under  the 
decomposition  (70).  With  (79),  the  limit  at  (78)  exists  and  is 
given  by 

7=1  ~P  (80) 


This  result  is  independent  of  the  specific  form  of  the  service 
times  <7i , . . . ,  <jk  under  conditions  (71)  and  (72).  Moreover, 
Corollary  9.3  automatically  applies  when  |77|  =  1,  in  which 
case  7  =  1  —  pi  under  the  convention  77  =  {1}. 

When  (79)  fails  to  hold,  then  the  existence  of  the  limit 
at  (78)  is  not  automatically  guaranteed.  Here,  as  we  did  in 
the  previous  sections,  we  need  to  strengthen  the  underlying 
assumptions.  Thus,  we  complement  (72)  by  requiring  the 
existence  of  an  R+-valued  rv  Z  £  S  such  that  for  each  k 
in  77, 

P[<T*k  >x]  ~CkP[Z>X]  (81) 


for  some  constant  Ck  >  0.  We  close  with  the  following  analog 
of  Proposition  8.5. 

Proposition  9.4:  Assume  that  (71)  and  (72)  hold  under  the 
decomposition  (70).  If  (81)  holds,  then  the  limit  (78)  exists  with 


X)fcg7r(l  Pk)dk 
Yhken  dk 


(82) 


provided 

^2  dk  >  0. 
ken 


X.  Discussion 


To  get  a  better  sense  of  the  impact  of  the  service  distri¬ 
butions  on  the  value  of  7,  we  compare  the  results  in  two 
extreme  situations,  namely  the  one  where  the  service  times 
are  all  exponentially  distributed  and  the  one  where  they  are 
all  heavy-tailed. 

As  seen  earlier,  when  the  workload  is  evenly  distributed 
with  pi  =  ...  =  pk  =  jt,  then  7  <  ^  in  the  exponential  case 
with  pi  =  ...  =  px,  while  7  still  goes  to  1  as  K  increases 
for  the  heavy-tailed  case  regardless  of  the  exact  distributions 
of  the  service  time. 

Now,  in  order  to  go  beyond  this  uniform  allocation,  we 
consider  the  following  situation.  The  number  I\  of  stations, 
the  customer  arrival  rate  A  and  the  load  allocation  vector  p  = 
(pi , . . .  ,pk )  are  given  and  will  be  the  same  in  both  cases. 
In  the  heavy-tailed  case,  we  take  the  service  times  to  have  a 
Pareto  distribution,  e.g.,  for  each  k  =  1, . . . ,  I\ ,  we  take 


P  [crfe  >  x] 


Bk 


Pk 


Bk  +  x 


x  >  0 


with  Bk  >  0  and  Pk  >  1.  In  that  case,  E  [07]  is  finite  and 
given  by 

It  is  well  known  that  <Jk  and  <r£  are  both  in  S  [6]. 

We  shall  take  pi  =  ...  =  3k  =  P  to  simplify  matters 
somewhat.  In  order  to  make  the  two  situations  comparable,  we 
require  that  the  expected  service  times  at  each  station  coincide, 
i.e., 

A— =  E  [tTfc]  =  — ,  k=l,...,K. 

P  pk 

Thus, 

P  —  1 

Bk  =  - - ,  k  =  l,...,K. 

Pk 

Let  1’f  (p)  and  l's(p)  denote  the  value  of  7  in  the  exponen¬ 
tial  and  Pareto  cases,  respectively,  under  the  load  allocation  p. 
These  quantities  were  evaluated  in  a  large  number  of  scenarios 
(some  of  which  are  reported  below).  The  findings  can  be 
summarized  as  follows: 

(i)  In  all  situations  considered,  we  had  I ’ £ (p)  <  I’,s(p).  In 
other  words,  everything  else  being  equal,  resequencing  is  felt 
more  strongly  in  the  heavy  tail  case; 

(ii)  When  pi  =  . . .  =  /j,k  (in  which  case  B\  =  . . .  =  Bk), 
we  found 

r sip1)  <  r s(p)  (83) 

whenever  p  is  “more  balanced”  than  p1 .  A  formal  way  to 
express  this  situation  is  to  say  that  p  is  majorized  by  p',  written 
p  <  p'  [11].  This  is  equivalent  to 

k  k 

fc  =  (84) 


=  (85) 
1  £=  1 

with  p(1)  <  p{2)  <  ...  <  p{K)  and  p'  <  p'{2)  <  ...  <  p'(K) 

denoting  the  components  of  p  and  p'  arranged  in  increasing 
order,  respectively.  The  comparison  (83)  is  certainly  in  line 
with  intuition  since  as  p  gets  more  skewed,  more  customers 
are  routed  to  a  smaller  set  of  queues  and  these  customers  will 
not  be  mis-ordered;  and 

(iii)  The  property  (83)  does  not  hold  in  the  exponential  case. 
In  many  cases,  T £  (p)  appears  to  be  less  sensitive  to  changes 
in  load  allocation  compared  to  V  s(p). 

We  illustrate  these  findings  through  the  following  numerical 
example  displayed  in  Figure  2.  The  parameters  used  are 
K  =  3,  A  =  5,  P  =  3  and  pi  =  p2  =  P3  =  4. 
Calculations  were  performed  for  the  four  load  allocations 
p4  =  (0.33,0.33,0.34),  p2  =  (0.25,0.25,0.5),  p3  = 
(0.1,0.35,0.55)  and  p4  =  (0.05,0.35,0.6).  The  data  points 
in  cross  and  circle  are  for  I'f  ip)  and  I\s’(p),  respectively. 
Note  that  p  ,  A  p2  -<  p  ,  A  p4,  and  that  the  corresponding 

5Obviously,  here  the  condition  (85)  automatically  holds  since  the  vectors 
of  interest  are  load  allocation  vectors. 


data  points  have  been  arranged  in  that  order  on  the  x-axis  at 
x  =  1,  x  =  2,  x  =  3  and  x  =  4,  repectively.  As  shown  in  the 
figure,  T5(p)  is  indeed  monotone  decreasing. 


Comparison  of  y  with  identital  service  rates 


Fig.  2.  Comparison  of  7  with  equal  service  rates 


XI.  Conclusions  and  future  work 

We  have  studied  the  impact  of  packet  mis -ordering  due 
to  multipath  routing  on  their  resequencing  delay  using  a 
simple  queueing  model.  A  new  metric  7  was  introduced  as 
the  asymptotic  ratio  between  the  tail  probability  of  the  rese¬ 
quencing  delay  and  that  of  the  end-to-end  delay.  We  evaluate 
this  parameter  under  different  service  time  distributions  (and 
hence  different  delay  statistics  along  the  multiple  paths).  By 
examining  7,  we  are  now  in  a  position  to  better  understand 
whether  the  long  end-to-end  delay  experienced  by  a  customer 
is  due  to  its  resequencing  delay. 

A  few  words  on  the  future  work.  Obviously,  we  can 
minimize  7  by  simply  sending  all  of  the  traffic  to  one  of 
the  stations.  Thus,  minimizing  7  alone  is  somewhat  coun¬ 
terproductive  as  this  would  of  course  negate  the  potential 
benefits  of  path  diversity.  On  the  other  hand,  it  is  proved  in 
[8]  that  when  the  service  time  distributions  are  identical  for 
all  queues,  the  uniform  load  allocation  minimizes  the  sojourn 
time  in  the  stochastic  order  and  the  end-to-end  delay  in  the 
convex  increasing  order.  The  remaining  question  is  how  to 
select  a  load  allocation  vector  so  that  the  expected  end-to- 
end  delay  is  minimized  while  keeping  7  below  some  given 
threshold.  Admittedly  our  choice  of  service  time  distributions 
with  various  tail  behavior  was  driven  mainly  by  mathematical 
convenience.  However,  validating  this  tail  behavior  (of  both 
service  times  and  delays)  in  current  networks  is  an  interesting 
direction  for  future  work. 
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Appendix  I 

A  proof  of  Proposition  5.2 

Fix  k  =  1 , ,K.  Lemma  4.1  yields 

Wk>x\~J2  Pee~atx  ~  rfc  •  e~<x  (86) 
e^k 

where  a*k  and  are  given  by  (27)  and  (28),  respectively. 

Fix  x  >  0.  The  independence  of  the  rvs  Wk  and  7’/,;  yields 


/  o 


P  |  Wk  >  x  +  Tk  |  =  ak 

which  leads  to 

e“*xP  I  Wk>x  +  Tk 


Wk  >  x  +  t 


~akt  dt , 


=  «fc 


Wk  >  x  +  t 


e -a£(x+i) 


i  .  e dt  (87) 


From  (86),  for  every  £  >  0,  there  exists  xk  =  xk(e)  >  0 
such  that 


rfc(i  -e)  < 


wk  >  x 


<  Tfe(l  +  e) 


whenever  x  >  xk.  Therefore,  on  that  range,  it  is  also  the  case 
that  the  bounds 

Oik 


eak*p 


Wk  >  x  +  Tk 


< 


Oi*k  +  ak 


and 


Olk 


oTk  +  ak 

hold  since 


rfc(l-e)  <  e“*xP 


Wk  >  x  +  Tk 


/»oo 

/  (at  +  ak)e~{ak+ak)t  =  1. 

Jo 


It  is  now  plain  that 


Wk  >  x  +  Tk 


Oik 


at  +  ak 


rfce-“*a 


(88) 


given  that  e  >  0  is  arbitrary. 

Next,  returning  to  (10),  we  conclude  that 


K 


P  [i?  >  a’]  ~  ^ 


oik  Ti 

Pk  +  ,  k  fcG 


^  a*  +  ak 

I  \  '  ak 

Pk 


ti6M. 


at  +  ak 


Tk  e~a*x  (89) 


where  we  have  set 


and 


a*  =  min  (a*, . . . ,  a*K)  (90) 


M*:={k=l,...,K  :  a*k  =  a*}  .  (91) 


We  reconcile  (89)  with  the  desired  result  (30)  upon  noting  that 
a*  =  a*  so  that  Ad*  =  M*. 


Appendix  II 

A  proof  of  Proposition  6.4 


Claim  (i)  is  Claim  (b)  of  Lemma  1.3.5  [6,  p.  41].  Claim 
(ii)  is  a  simple  consequence  of  the  membership  X  £  C  upon 
taking  x  going  to  infinity  in  the  relation 


P  [V*  >  x\ 
P  [X  >  x] 


P[X>  x  +  t] 
P[X>  a-] 


dt 


valid  for  all  x  >  0.  Details  are  standard  and  left  to  the 
interested  reader. 

Finally,  set 


P(x)  :=  P  [X  >  x] ,  x  >  0. 


By  the  independence  of  the  rvs  X  and  Y,  we  have 


P  [X  >  *  +  Y]  _  \P(x  +  Y) 
P  [X  >x]  ~  [  P(x) 


x  >  0. 


(92) 


Now,  let  x  go  to  infinity  in  this  last  relation,  and  observe 
that  x  — >  P(x)  is  non-increasing  over  R  +  .  The  Bounded 
Convergence  Theorem  yields 


,im  PI*:*4?'1  =  E  f  lim  +L±X1 

X—>00  P  [A'  >  X\  [x— >oo  P(X) 


□ 


rfe(l  +  e) 


since  X  £  C,  and  the  desired  conclusion  follows. 


Appendix  III 

A  proof  of  Proposition  7.2 

The  proof  is  similar  to  that  of  Proposition  5.2  given  in 
Appendix  I:  Fix  k  =  1, . . . ,  K.  Using  the  Cramer-Lundberg 
approximation  (46)  together  with  Lemma  4.1,  we  get 

Wk>x\~J2Cce~a‘X-  (93> 

t^k 

Therefore,  with  ak,  Aik  and  I ’ k  defined  by  (27),  (29)  and 
(56),  respectively,  we  get 


for  each  t  >  0,  and  Wk  €  C  as  well.  In  Wk  £  £  as  well.  In 
other  words. 


P 

lim  — 

x—*oo  p 


Wk  >  x  +  t 


Wk>  x 


d-  =  1,  t  >  0 


(99) 


with  the  limit  taking  place  from  below.  The  rvs  Wk  and  Tk 
being  independent,  the  equivalence 


Wk  >  x  +  Tk 


Wk>  x 


(100) 


Wk  >  x 


Tfce 


(94) 


Thus,  for  each  e  >  0,  there  exists  x*k  =  x*k{e)  >  0  such 
that 


follows  from  Claim  (iii)  of  Proposition  6.4. 

Returning  to  (10),  we  conclude  from  (98)  and  (100)  that 


K 


rfc  (1  — e)  < 


Wk  >  X 


e  * 


<  Tfe  (1  +  s) 


(95) 


P  [R  >  x] 


E^p 


whenever  x  >  x\.  Therefore,  on  that  range,  upon  making  use 
of  the  independence  of  the  rvs  Wk  and  7):,  we  of  the  rvs  Wk 
and  Tk,  we  readily  get  the  bounds 

o~aUx+Tk)] 


and 


Wk  >  x  +  Tk 


Wk  >  x  +  Tk 


>  Tk  (1  -  e)  E 
=  Tk  (1  -  e)  E 

<  rfc(l  +  e)E 


=  rfc(l  +  e)E 

With  e  >  0  arbitrary,  it  follows  that 


e~akTk 


,-ak(x+Tk) 


[Wk  >  *] 

fc= l 

j^Pk  (Ep[^> 

fc= i 

E  E^  p  ^Wt  >  ^ 

i-i  \k^e  I 


and  (64)  follows. 


□ 


o~akTk 


p 

1 YY1  _ 

Wk  >  X 

-  - 

p-aUx  +  Tk) 

(96) 

P 

Wk  >  x 

11111  - 

x—*oo 

e~alx 

—  l  k  -*-j 

This  constitutes  the  appropriate  generalization  of  (88). 
Reporting  this  fact  into  (10),  we  find 

K 


Appendix  V 

A  proof  of  Proposition  9.2 
Fix  k  =  1, ...  ,K  and  recall  from  Lemma  4.1  that 

E  P  [Wt  >  x] . 

tjtk 

For  k  in  £.  it  is  plain  from  (73)  and  (101)  that 


fc= l 


Wk  >  x 


E  P[We>x}. 


(101) 


(102) 


p  [R  >  x]  ~  y^pfcrfcE 
:1 

E  PaA-E 


ten 


(97) 


vfceAt* 


with  ct*  and  A4+  as  defined  at  (90)  and  (91),  respectively. 
The  desired  result  (57)  follows  from  (97)  upon  noting  again 
that  a*  =  a*  and  Ad*  =  AA*.  □ 


Appendix  IV 

A  proof  of  Proposition  8.3 
Fix  k  =  1, ...  ,K.  Lemma  4.1  yields 

Wk  >  x\  ~  e  P  (98) 

tjtk 

where  for  each  £  =  1, ...  ,K,  Wt  £  S ,  hence  Wt  £  C,  by 
Proposition  8.1.  Consequently, 

P  [We  >  x  +  t]  ~  P  [Wt  >  x] ,  l=\  ,...,K 


But  for  each  £  in  H,  Wt  £  5  by  Proposition  8.1,  whence 
Wt  £  C.  Thus,  the  rvs  Wt  and  7).  being  independent,  we  find 

P  [Wt  >  x  +  Tk]  ~  P  [We  >  x] ,  £  £  U 

by  Claim  (iii)  of  Proposition  6.4,  and  it  is  now  a  simple  matter 
to  check  that 


Wk  >  x  +  Tk 


p  lwt  >  x] ,  k  £  £.  (103) 

ecu. 


For  k  in  77,  the  situation  is  somewhat  more  involved:  If 
|77|  =  1,  write  77  =  {1}  and  £  =  {2,..., A'}  for  sake  of 
definiteness.  Thus,  k  in  77  means  k  =  1,  and  (101)  becomes 


K 


Wx>x 


E  p  [Wt>  x] 


(104) 


1-2 


with 


P  [Wt  >x]=  o(P  [Wi  >  *]),  £  =  2,...,K 


(105) 


by  virtue  of  (73).  This  yields 

P  \Wi  >  x\  =o(P[W1  >x]), 

whence 

p\Wx>x  +  tA  =  o(P  [Wi  >  x\)  (106) 

by  a  simple  bounding  argument.  On  the  other  hand,  (103) 
specializes  here  to 

P  \wk  >  x  +  Tk]  ~  P  [Wi  >  x\ ,  k  =  2, . . .  ,K.  (107) 
Combining  (106)  and  (107)  leads  to 

K 

p  [r  >  x]  ~  y^pfcp  [Wi  >  x] 

k—2 

~  (l-pi)P[Wi>*]  (108) 

and  (77)  indeed  holds. 

Assume  now  that  \H\  >  1.  For  each  k  in  Ti,  we  write  Hk 
to  denote  the  subset  of  Ti  obtained  by  deleting  k  from  it.  The 
arguments,  based  on  (73)  and  (101),  and  which  lead  to  (102), 
can  also  be  used  to  conclude 

P  \\Vk  >x\~  ^  P  [We>x\.  (109) 

eenk 

Again,  Claim  (iii)  of  Proposition  6.4  can  be  used  to  validate 
the  equivalence 

P  [We  >  x  +  Tk\  ~  P  [Wt  >  x] ,  £  €  Hk 

since  W(  €  C  for  l  in  Hk  and  the  rvs  W (  and  Tk  are 
independent.  It  is  now  a  simple  matter  to  conclude  via  (109) 
that 

P  \wk  >x  +  Tk]  ~  P[Wi  >*],  k£H.  (110) 
eenk 

Combining  (103)  and  (110)  yields 

P  [R  >  x]  ~  E^E  P  [Wi  >  x] 
ke£  ee n 

+  E  Pk  E  P  >  A 

ken  ienk 

~  (E^J  (Ep[^>a;] 

\kes  )  Keen 

+  E  pk  ( E  P  [Wt  >  x]  —  P  [Wfc  >  x] 
ken  Keen 

~  (Epfc  +  Epfe)  E11^  >a;] 

\ke£  ken  J  ien 

-  ^2  VkP  [Wh  >  x] 
ken 

and  the  conclusion  (77)  follows.  □ 


